A New Approach for Scalable Analysis of Microbial Communities
نویسندگان
چکیده
Motivation: Microbial communities play important roles in the function and maintenance of various biosystems, ranging from human body to the environment. Current methods for analysis of microbial communities are typically based on taxonomic phylogenetic alignment using 16S rRNA metagenomic or Whole Genome Sequencing data. In typical characterizations of microbial communities, studies deal with billions of micobial sequences, aligning them to a phylogenetic tree. We introduce a new approach for the efficient analysis of microbial communities. Our new reference-free analysis technique is based on n-gram sequence analysis of 16S rRNA data and reduces the processing data size dramatically (by 10 fold), without requiring taxonomic alignment. Results: The proposed approach is applied to characterize phenotypic microbial community differences in different settings. Specifically, we applied this approach in classification of microbial communities across different body sites, characterization of oral microbiomes associated with healthy and diseased individuals, and classification of microbial communities longitudinally during the development of infants. Different dimensionality reduction methods are introduced that offer a more scalable analysis framework, while minimizing the loss in classification accuracies. Among dimensionality reduction techniques, we propose a continuous vector representation for microbial communities, which can widely be used for deep learning applications in microbial informatics. Availability: The Matlab code and data will be available on: http://llp.berkeley.edu. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
منابع مشابه
Cost Effective and Scalable Synthesis of MnO2 Doped Graphene in a Carbon Fiber/PVA: Superior Nanocomposite for High Performance Flexible Supercapacitors
In the current study, we report new flexible, free standing and high performance electrodes for electrochemical supercapacitors developed througha scalable but simple and efficient approach. Highly porous structures based on carbon fiber and poly (vinyl alcohol) (PVA) were used as a pattern. The electrochemical performances of Carbon fiber/GO-MnO2/CNT supercapacitors were characteriz...
متن کاملSulfurous Analysis of Bioelectricity Generation from Sulfate-reducing Bacteria (SRB) in a Microbial Fuel Cell
The current importance of energy emphasizes the use of renewable resources (such as wastewater) for electricity generation by microbial fuel cell (MFC). In the present study, the native sulfate-reducing bacterial strain (R.gh 3) was employed simultaneously for sulfurous component removal and bioelectricity generation. In order to enhance the electrical conductivity and provision of a compatible...
متن کاملتشخیص اجتماعات ترکیبی در شبکههای اجتماعی
One of the great challenges in Social Network Analysis (SNA) is community detection. Community is a group of vertices which have high intra connections and sparse inter connections. Community detection or Clustering reveals community structure of social networks and hidden relationships among their constituents. By considering the increase of datasets related to social networks, we need scalabl...
متن کاملDynamic configuration and collaborative scheduling in supply chains based on scalable multi-agent architecture
Due to diversified and frequently changing demands from customers, technological advances and global competition, manufacturers rely on collaboration with their business partners to share costs, risks and expertise. How to take advantage of advancement of technologies to effectively support operations and create competitive advantage is critical for manufacturers to survive. To respond to these...
متن کاملIntelligent scalable image watermarking robust against progressive DWT-based compression using genetic algorithms
Image watermarking refers to the process of embedding an authentication message, called watermark, into the host image to uniquely identify the ownership. In this paper a novel, intelligent, scalable, robust wavelet-based watermarking approach is proposed. The proposed approach employs a genetic algorithm to find nearly optimal positions to insert watermark. The embedding positions coded as chr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1512.00397 شماره
صفحات -
تاریخ انتشار 2015